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ABSTRACT 



X-ray and Sunyaev-Zel'dovich effect observations can be combined to measure 
the distance to clusters of galaxies. The Hubble constant, Hq, can be inferred 
from the distance to low-redshift clusters. With enough clusters to measure the 
Qh| redshift-distance relation out to a redshift z ~ 1, it may be possible to determine 

O ' the total matter density, VLq, and the cosmological constant, Aq, as well. If the 

^ \ intracluster gas distribution is not spherical, but elongated by a factor of Z along 

the line of sight, the inferred distance is increased by Z, and Hq is decreased by 
the same factor. Averaging the inferred value of Hq over a sufficiently large 
^ ' sample of clusters can reduce any systematic bias due to cluster shapes, provided 

c3 \ the clusters are selected without any preferred orientation. Even so, elongation 

contributes significantly to the variance in the measured distances and in the 
inferred value of Hq. 

With the addition of gravitational lensing observations, it is possible to infer 
the three-dimensional shape of an individual cluster, provided the gas is in hydro- 
static equilibrium. We demonstrate a specific method for finding the shape and 
correcting the measured distances to individual clusters. To test this method, 
we apply it to artificial observations of simple model clusters. We base the arti- 
ficial X-ray observations on the Chandra X-ray Observatory. For the SZ effect, 
we assume modest improvements over current observations at the Owens Valley 
Radio Observatory. We recover the true distances to each of our clusters without 
detectable bias, and with statistical errors due to measurement uncertainties of 
4 to 6%. 
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Subject headings: cosmic microwave background — distance scale — galaxies: 
clusters : general — gravitational lensing — X-rays: galaxies: clusters 

1. Introduction 

The discovery of X-ray emission from clusters of galaxies revealed that clusters are filled 
with ionized gas. Not long after this discovery, Sunyaev & Zel'dovich (1972) pointed out that 
the electrons in this gas should also scatter photons from the Cosmic Microwave Background 
(CMB), and that the thermal motion of the electrons would produce a detectable shift in 
the CMB spectrum. This shift is known as the Sunyaev-Zel'dovich (SZ) effect. Together, 
SZ effect and X-ray observations of the intracluster gas determine the physical size of the 
cluster along the line of sight. Comparing this scale with the apparent angular size of the 
cluster yields a direct measurement of the angular diameter distance, assuming the cluster 
is spherical (Cavaliere, Danese, & de Zotti 1977; Gunn 1978; Silk & White 1978). This 
geometric estimate is independent of the usual distance ladder. Using cluster distance and 
redshift measurements for low redshift clusters, the Hubble constant, Hq, can be determined. 
Measurements of the redshift-distance relation at redshifts up to z ~ 1 would constrain the 
matter density, Qq, and and the cosmological constant, Aq, in units of the critical density 
(Carlstrom et al. 2001). 

We know from X-ray images that clusters are often not round in projection, and therefore 
cannot be spherical (see, e.g., Mohr et al. 1995). A number of authors have investigated the 
effect of asphericity on the inferred value of Hq, either with simulations (Inagaki, Suginohara, 
& Suto 1995; Roettiger, Stone, & Mushotzky 1997; Yoshikawa, Itoh, & Suto 1998) or analytic 
ellipsoidal models of the gas (Cooray 1998, 2000; Hughes & Birkinshaw 1998; Sulkanen 1999; 
Puy et al. 2000). For an individual cluster, the estimated distance will differ from the actual 
distance by a factor of the ratio, Z, of the size of the cluster along the line of sight to its size 
in the plane of the sky. The usual solution to this problem has been to suggest averaging the 
inferred values of the Hubble constant over a sample of clusters. Here, we propose a different 
approach. We will demonstrate that it is possible to infer the three-dimensional shape of an 
individual cluster, and recover the true distance. 

In its simplest form, the idea behind the distance determination is as follows. The 
SZ effect is caused by Thompson scattering of CMB photons by electrons. It is therefore 
proportional to the column density, Ne = neAl, where ng is the electron number density 
and Al is a measure of the path length of a given line of sight through the cluster. The 
X-ray emission is due to Bremsstrahlung from electron-ion collisions, so the X-ray surface 
brightness, E^, is proportional to n^A/. Squaring ngA/ and dividing by rig A/ from the X- 
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ray observations, we can eliminate the electron density to find the path length. Finally, by 
comparing this path length with the angular extent of the cluster, wc can infer the angular 
diameter distance, to the cluster. For clusters at redshift 2; -C 1, we can calculate the 
Hubble constant, Hq — cz/D. 

We have neglected the fact that both the SZ effect and the X-ray emissivity depend on 

the temperature of the electrons as well as their density. We describe the method in more 
detail, taking temperature into account, in §2. However, this simplified picture suffices to 
illustrate the effect of cluster shapes on the inferred distance. The method assumes that 
the cluster is spherical, so that the path length along the line of sight and the width of 
the cluster perpendicular to the line of sight are the same. If the actual gas distribution is 
not spherical, this assumption can introduce a systematic error in the inferred distance. If 
the gas distribution is elongated by a factor of Z along the line of sight, this will increase 
the measured SZ effect and X-ray surface brightness by Z, without changing the apparent 
angular size of the cluster. As a result, the inferred distance will be increased by a factor of 
Z relative to the true distance, and Hq — cz/D will be underestimated by Z. 

It has generally been assumed that averaging the inferred values of the Hubble constant 
over different cluster orientations at random would eliminate any systematic bias. In fact, 
the validity of this assumption seems to depend on both the intrinsic shapes of clusters 
(whether prolate, oblate or triaxial) and on the choice of projected axis used to establish the 
angular scale (Cooray 1998, 2000; Sulkancn 1999), thoTigh the remaining systematic errors 
are relatively small (3 — 10%) if the semi-major projected axis is used. 

In addition, removing the systematic effect of elongation requires averaging over an 
unbiased sample of clusters. If clusters are chosen based on X-ray surface brightness or 
SZ effect amplitude (or eliminated based on non-detections), clusters elongated along the 
line of sight will be favored (Birkinshaw, Hughes, & Arnaud 1991). Clusters elongated due 
to recent mergers may be favored because of their higher X-ray luminosity (Roettiger et 
al. 1997). Some authors (Roettiger et al. 1997) have suggested selecting clusters without 
evidence of recent mergers, but even this may introduce its own bias if the merger signatures 
used are not independent of orientation. 

Finally, even with an unbiased sample, asphericity contributes roughly 15% to the rms 
scatter in the value of Hq from individual clusters (Inagaki et al. 1995; Roettiger et al. 1997; 
Sulkanen 1999). Thus measurements are required for a large (~ 25) cluster sample. This 
may limit more detailed studies of the redshift-distance relation to estimate and Aq. Some 
evidence (Cooray 1998) suggests that a large part of the scatter in measurements of Hq from 
X-ray and SZ effect observations is due to variations in cluster shapes. 



-4- 



If we had a method of determining the three-dimensional shape of individual clusters, 
we could correct for the effect of elongation on the distance. This would enable us to reduce 
the scatter in the inferred redshift-distance relation. In addition to reducing the number of 
clusters needed to achieve a given level of precision, this might also allow us to study other 
effects (to quantify any variations in the dumpiness of the intracluster gas from one cluster 
to another, for instance). 

The degeneracy between the distance, D, and the elongation factor, Z, is intrinsic 
to observations of projected cluster properties. Therefore, to measure elongation without 
prior knowledge of we need an additional theoretical constraint. An obvious choice is 
hydrostatic equilibrium. This assumption in turn introduces a new unknown, the cluster 
mass distribution. To compensate, we must add an additional observational constraint, the 
projected mass distribution as measured by gravitational lensing. We will prove that the 
assumption of hydrostatic equilibrium, together with lensing observations, is sufficient to 
break the degeneracy and allow us to recover both the elongation and the distance. 

Such an approach has been suggested by Zaroubi et al. (1998, 2001). Rebhnsky (2000) 
has also discussed a method for reconstruction of the three-dimensional structure of clusters. 
The perturbative approach of Dore et al. (2001) may also be useful in inferring asphericity. 

Cooray (1998) has suggested a somewhat different way of breaking the same degeneracy. 
While it also combines X-ray, SZ effect, and lensing observations, it is based on the assump- 
tion of a universal (and measurable) baryon fraction for clusters, rather than on that of 
hydrostatic equilibrium. 

We will demonstrate that our method can successfully recover the shape and distance 
with high accuracy when applied to simple analytical models of clusters. We first test the 
method on clusters which are axisymmetric about line of sight, and then on triaxial clusters 
of arbitrary orientation. In the future, we will test its ability to do the same when applied 
to simulated clusters, which more closely resemble real ones, at least in their complexity and 
asymmetry. We should note that one advantage of the traditional method of inferring Hq 
from SZ effect and X-ray observations is that it does not depend on hydrostatic equilibrium. 
Deviations from hydrostatic equilibrium in real clusters may introduce a systematic error in 
the value of Hq inferred from any method which does rely on this assumption. It is essential 
to apply our method to simulated clusters to attempt to quantify any such systematic error. 

In §2, we will explain the elongation-distance degeneracy in more detail. By using 
ellipsoidal potentials, we will prove analytically that the hydrostatic equilibrium assumption 

and weak lensing observations are sufficient in principle to break the degeneracy in the case 
of a cluster axisymmetric about the line of sight. In practice, the actual model we use to 
infer the elongation from artificial observations uses ellipsoidal mass distributions. We will 
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formulate this method in §3 for clusters with arbitrary triaxial shapes. 

In §4, we will introduce the simple analytic clusters we use to test the de-projection 
method. We will describe how we contruct artificial observations of these clusters, and 
specify our assumptions about the uncertainties in these observations in §5. Some addi- 
tional numerical details of the calculation are described in §6. We apply our de-projection 

method to these clusters and present our results in §7. We discuss the uncertainties in the 
inferred distance, both statistical and systematic in §8. Finally, in §9, we will summarize 
our conclusions and highlight directions for future work. 



2. The Elongation-Distance Degeneracy 

2.1. The Spherical Case 

To understand how the distance can be determined from X-ray and SZ effect mea- 
surements, consider the observables. The Sunyaev-Zel'dovich effect is due to the Compton 
scattering of microwave background photons by the electrons in the intracluster plasma, 
with a probability per unit path length, dl, equal to the Thomson cross-section, ax, times 
the electron density, Ue- The photon frequency, u, follows a random walk with {dv/vf per 
scattering equal to the line of sight velocity dispersion (vj^Qg) — kTe/nie of the electrons, 
divided by c^. The variance (Su/i/)^ integrated along the path through the cluster is given 
by the Compton y-parameter, 

y= \~~] = I dlrieTe, (1) 

\u J ruec^ J 

where k is Boltzmann's constant. Observationally, what is measured is not the frequency 
shift of individual photons, but the change in the black body spectrum of the CMB. This is 
commonly expressed in terms of the shift in the brightness temperature at a given frequency, 
{AT/T)cmb{^), which is proportional to y: 

(^) (ly) = fihu/kTcMB, Te) y, (2) 

\ ^ / CMB 

where f ^ —2 (plus relativistic corrections) when hu/kTcuB < 1- When we refer to the SZ 
effect, we mean, technically, the thermal SZ effect due to the isotropic, thermal motions of 
the electrons. If the cluster has a nonzero peculiar velocity with respect to the Hubble flow, 
this will cause a systematic frequency shift known as the kinetic SZ effect. 



The X-ray emissivity is due to Bremsstrahlung from electron-ion collisions, so it is 
proportional to the square of the electron density. Modern X-ray telescopes, such as the 
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Chandra X-ray Observatory (Chandra) and XMM-Newton, can measure the X-ray surface 
brightness as a function of photon energy, with spectral resolving power E / /S.E ~ 20 - 40 
at 6 keV. Thus, they can measure the X-ray surface brightness per unit energy interval. 



where z is the redshift of the cluster, E is the photon energy in the observer's frame, E' = 
£^(1-1-2;) is the rest frame photon energy, and dA{Te, E')/dE' is the X-ray spectral emissivity. 

Both the X-ray surface brightness and the temperature decrement can be mapped as 
a function of angular position, 0, on the sky. If the electron density and temperature 
are spherically symmetric, then these spatially resolved observations can be de-projected. 
Because the observations are functions of angular position, 9^ the de-projection determines 
DrieTf. and Dnl dA{Te, E') / dE' as functions of r/D, where r is the radius and D is the 
angular diameter distance to the cluster. Given X-ray observations with sufficient spatial 
and energy resolution, the temperature profile, Te{r/D), can be inferred. This allows us 
to eliminate the temperature-dependent factors, to obtain Drie from (AT/T)cmb and Dnl 
from Ex- The ratio of the square of the SZ effect measurement to the X-ray measurement 
gives the angular diameter distance. 



Now, suppose that contours of constant rig and Tg are not spherical, but are instead 
elongated by a factor of Z along the fine of sight. Then, the integration path length and 
thus the normalization of the projected observables will be increased by a constant factor 
of Z. The shapes of the measured profiles, (AT/T)cmb(^) and T,x{0) will be unchanged, so 
there will be no obvious sign that the cluster is not spherical. If the data are analyzed under 
the assumption of spherical symmetry, the inferred distance will be DZ'^/Z = DZ, since the 
square of the temperature decrement is being compared to the X-ray surface brightness. 

Any method which relies on the normalization of quantities integrated along line of 

sight suffers from this degeneracy; only the product of the distance and the elongation can 
be determined. Observations of additional projected quantities do not help. The only way to 
break the degeneracy is to postulate an additional constraint among the cluster properties. 
The hydrostatic equilibrium condition. 



provides such a constraint between the pressure, p, the gas density, pgas, and the gravitational 
potential, 0. 




(3) 



2.2. The Elongation-Distance Degeneracy 



(4) 
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Of course, while hydrostatic equihbrium adds a constraint, it also introduces a new 
unknown, the gravitational potential, 0, of the cluster. Fortunately, this extra degree of 
freedom can in turn be constrained by weak gravitational lensing observations, as we will 
see in §2.3. Then, in §2.4, we will show how adding weak lensing data and the hydrostatic 
equilibrium condition allows us to break the degeneracy. 



The gravitational potential of a cluster deflects light rays from sources behind the cluster. 
The lensing effect is determined by the convergence, k, which is the ratio of the projected 
surface density, of the lens at angular position 0, to the critical surface density. 



where Dg, Di, and Dig are angular diameter distances to the source, to the lens, and from the 
lens to the source, respectively. Since each of the angular diameter distances is proportional 
to Hq^, Ecr is proportional to Hq. However, the product A^cr which we will encounter 
depends only on the source and lens redshifts, and on the cosmological parameters (Qq, Aq, 
etc.) which determine the shape of the redshift-distance relation, but is independent of Hq. 

The convergence is also equal to (1/2) V^V'^ where 



is called the lens potential (Schneider, Ehlers, & Falco 1992). 

The deflection of light rays distorts the images of background galaxies, magnifying them 
in both area and flux, and changing their shapes. In the weak lensing limit, where the second 
partial derivatives {ip^ijl <^ 1, these effects must be measured statistically by averaging over 
many background galaxies. In this limit, the images are magnified by 1 + 2k, while the 
distortion in their shapes is characterized by the 2-component shear, 7^, where 



2.3. Gravitational Lensing 



Dg 



(5) 




(6) 



71 = 2^^'^^ ~ "^'22^ 



(7) 



and 



72 = ^,12- 



(8) 



Direct measurements of the magnification in the weak lensing limit are difficult, though 
several techniques have been suggested (Bartelmann & Narayan 1995; Broadhurst, Taylor, 
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& Peacock 1995) and at least one cluster has been detected and mapped by such techniques 
(Taylor et al. 1998). More frequently, the shear is measured from the galaxies shapes. Kaiser 
& Squires (1993) demonstrated a method for reconstructing k, up to a constant offset, from 
maps of the shear (see Mellier 1999, for a review). The ambiguity of this offset is known as 
the mass sheet degeneracy. 

We should note that equation (5) assumes a single, known source redshift, z^. In the 
weak lensing limit, can simply be replaced with its average over the redshifts of the 
sources (Kaiser & Squires 1993; Seitz & Schneider 1997). Still, it is necessary to estimate 
the source redshift distribution. For zi <^ Zs^ this task is simplified by the fact that the ratio 
Dg/Dia becomes independent of the source redshift. More generally, several methods have 
been suggested for estimating the source redshift distribution from the lensing data itself 
(Small, Ellis, & Fitchett 1994; Kneib et al. 1994, 1996; Bartelmann & Narayan 1995). 



2.4. Breaking the Degeneracy 

Equation (4) implies that the contours of constant pressure coincide with the cluster 
isopotentials. Taking the curl of both sides, we see that Vpgas x V0 = 0, so the isopotentials 
are also contours of constant gas density. Finally, since p = PgagkT / fj^rrip, where prrip is 
the mean mass per particle, they are contours of constant temperature as well. Thus, for 
a potential with a single minimum, both pgas and T are functions of (f). Then, the balance 
between pressure and gravity perpendicular to the isopotentials can be rewritten as 

^ = -Pgas: (9) 

or 

dT dlogUe ^ iinip 

d(f) d(P k ' ^ ^ 

Since the observations are made as a function of angular position, 0, it is convenient to 
define a coordinate ( — l/D. Then, we can define a three-dimensional position x = r/D — 
{d, C) in radians. We also define the ellipsoidal equivalent, m, of a radius, by 

m'^=x-E'^-x, (11) 

where E'^ is a symmetric, positive-definite matrix. Surfaces of constant m form a family of 
similar, concentric, coaxial ellipsoids. In the case of a cluster with isopotentials elongated 
along the line of sight by Z, 

m = ^Jd^ + CyZ^. (12) 
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The gas properties are functions of and thus of m. 

If we apply a spherical de-projection to such an elongated cluster, we will obtain DZrieT^ 
and DZnl dA{Te, E')/dE' as functions of m. Again, we can extract the temperature profile, 
Te{m), which depends only on the ratio between the X-ray surface brightness in different 
bands, and is therefore independent of Z. Rewriting equation (10) in terms of derivatives 
with respect to m, we see that 

dT ^dlogUe /imp d(j) 
dm dm k dm 

Both terms on the left-hand side of this equation can be inferred from the X-ray observa- 
tions, independent of D and Z, so we can integrate to find 0(m) independent of D or Z. 
Equation (6) relates to ip, so we can predict 

JdC4> {m{d, 0) = I / d{l/Di) <p = ^^^^^m (14) 

as a function of 6. As we noted in §2.3, T^c^Di depends upon the source and lens redshifts 
and on cosmology, but is independent of Di and Hq, so we can predict ijj/Z. Taking the 
appropriate partial derivatives with respect to 9i, we can calculate both k/Z and ^y/Z. 
Comparing with k or 7 from lensing observations, we can solve for the elongation, Z. Either 
K or 7 is sufficient to fix Z. Therefore, in principle, Z can be determined from weak shear 
measurements alone, without the need to infer k or to resolve the mass sheet degeneracy. 

Thus, given the X-ray and weak lensing observations, the hydrostatic equilibrium con- 
dition, the cosmology and the distribution of source redshifts, we can infer the elongation, Z. 
This breaks the degeneracy. RecaUing that the X-ray and SZ effect observations determine 
DZ, we see that we can infer D and thus Hq. 

3. Cluster De-projection 

Finding the three-dimensional cluster properties from the two-dimensional observations 
is an inverse problem. Following the usual approach, we construct a set of parameterized 
three-dimensional cluster models. The angular diameter distance to the cluster is included as 
one of the parameters. For any given set of parameter values, we can predict the observations 
from the model. We define a function quantifying the differences between the actual 
observations and these predictions, and minimize it to find the best fit parameters. 

As always, the trick is to choose the set of models well. If the set is not general enough, 
we will get deceptively tight constraints on the distance, based as much on the restricted 
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choice of models as on the data. Any systematic differences between the best-fitting model 
and the data may also translate into systematic errors in our estimation of the distance and 
elongation. On the other hand, if the set is too general, it will be impossible to constrain 
the parameters. 

To begin, we will discuss the appropriate symmetry assumptions for elongated clusters 
in §3.1. We will argue that an ellipsoidal mass distribution is the best choice. In §3.2, we 
will discuss the constraints on the mass distribution from the weak gravitational lensing 
observations. We will then add a hydrostatic equilibrium model for the gas and define the 
function in §3.3 

3.1. De-projection and Symmetry 

Projection always involves loss of information. Therefore, de-projection requires sym- 
metry assumptions. Since we are interested in elongated clusters, it is natural to consider 
ellipsoidal symmetry. Analytically, the de-projection is simplest if the gas properties, and 
thus the gravitational potential, are constant on a family of similar, concentric, coaxial ellip- 
soids. Unfortunately, there is no guarantee that a given ellipsoidal potential will correspond 
to a physically sensible mass density distribution, p = V"^ cj) / AttG . In particular, for isother- 
mal gas, the ellipsoidal generalization of the usual /3-model for cluster X-ray emissivity has 
a potential 

0OC A;Tlog(l+m^). (15) 

However, for prolate ellipsoids with axis ratio greater than •\/3/2, the corresponding density 
distribution becomes dumbbell-shaped, which is unlikely for a cluster in equilibrium. For 
oblate ellipsoids with axis ratio less than ^J\J2^ the density actually becomes negative in 
some regions. Thus, while the ellipsoidal potential model was useful for demonstrating the 
distance-elongation degeneracy and its resolution analytically in §2, it is not necessarily the 
best model to use in practice. 

Therefore, we will instead assume that the total mass density is constant on similar 
ellipsoids. This ensures a sensible mass distribution. However, it also means that the isopo- 
tentials will not in general be ellipsoids. In addition, the shapes of the isopotentials will 
depend not only on the shape of the isodensity contours, but on the density profile, p(vr?\ 
as well. Thus, we cannot construct a model to predict the X-ray and SZ effect observations 
until we have determined the density profile. Consequently, we must break the construction 
of the model into two steps. First, we construct a set of possible mass models, including 
both the shape and the density profile, as constrained by the gravitational lensing observa- 
tions. For each mass model, we calculate the gravitational potential. Second, we add the 
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intracluster gas in hydrostatic equilibrium. This allows us to predict the X-ray and SZ effect 
observations, and to adjust the free parameters of both the mass and gas models to find the 
best fit. 

Jing & Suto (2002) have fit triaxial models to the mass within isodensity contours 
of simulated clusters. They find that the orientation of the axes is relatively stable as a 
function of density, though the central regions tend to be more elongated. However, to 
keep the number of model parameters to a minimum, we use ellipsoids of fixed shape and 
orientation. 



The shape of a triaxial eUipsoidal mass distribution is described by a pair of axis ratios. 
The orientation relative to the observer's line of sight is specified by three Euler angles. These 
five parameters, together with the density profile, p(m^), characterize the mass distribution. 
There is an additional arbitrary choice of scale in the definition of m^. However, a rc-scaling 
of p{'rrP') can always compensate for a change in the choice of scale, so the scale is not an 
additional parameter. 

The weak gravitational lensing observations measure the convergence, k,{6) = E(0)/Ecr. 
The critical surface density depends on Hq, but the product DiT^cr does not. Thus, we can 
measure the product 



For an ellipsoidal mass distribution, p{rn?), the projected mass density will be constant 
on a family of similar ellipses, parameterized by a single axis ratio and a single orientation 
angle. By fitting an elliptical projected mass distribution to the lensing data, we obtain 
these two parameters, together with D^E as a function of the elliptical coordinate, &, defined 
analogously to the ellipsoidal "radius" , m. Uncertainties in the lensing data will of course 
allow for some variation in these quantities. For now, however, we will assume that they are 
fixed. 

Since the shape and orientation of the ellipsoidal distribution are described by five 
parameters, there will still be three free parameters required to uniquely determine the 
shape and orientation of the mass distribution, even with perfect lensing information. First, 
consider a particular member of this three-dimensional parameter space; suppose that the 
surfaces of constant density arc prolate axisymmctric ellipsoids generated by rotating the 
elliptical contours of projected mass about their major axis. Then, we can de-project to find 



3.2. Gravitational Lensing and the Mass Model 




(16) 
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the density profile Dfp{m ) in the same way one de-projects a spherical distribution. 

To obtain the rest of the possible eUipsoidal de-projections, we consider the three famihes 
of transformations which map ellipsoids into ellipsoids without changing the projected mass 
distribution. Specifically, one family corresponds to a stretching along the line of sight, 

^ p'(x) = Z'lX^'), (17) 

where 

x'=(0',C') = (^,C/^mass), (18) 

and Ziaass is the elongation of the isodensity contours relative to the prolate case. 
The other two families correspond to shear transformations, 

p(x)^p'(x)=p(x'), (19) 

where 

x' = (0',C') = (^,C + s-^), (20) 
characterized by a 2-vector, s, in which the mass is displaced along the line of sight by an 
amount, s ■ 6, proportional to 6. 

The stretching transformation does not commute with the shearing transformation, so 
wc must specify the order in which they are applied. For a given elliptical projected mass 
distribution, we obtain a particular de-projection from the canonical prolate case by shearing 
it by s, and then stretching it along the line of sight by Z^ass- 

Wc will test various values of .^mass and s, and use the X-ray and SZ effect observations 
to determine the best values. All the projected obscrvables are invariant under a reflection 
through the plane of the sky, which corresponds to s ^ — s, so it is impossible to tell which 
end of the cluster is pointing at us. 

Ellipsoidal clusters axisymmetric about the line of sight, for which s = 0, have an axis 
ratio equal to .^mass- In the general case of triaxial ellipsoids of arbitrary orientation, that 
will no longer be true. The axis ratios will depend in a complicated manner on the axis ratio 
of the elliptical projected mass distribution, the magnitude and orientation (with respect to 
the projected ellipse) of the shear, s, and on Zmass- 

To be more precise, the shear transformation preserves the thickness, 

AC(m, 6) = 2Vm2 -6 S O, (21) 

of the canonical prolate eUipsoid at fixed 0. However, it increases the total extent, 

A({m) = maxC(m, 6) — min ({m, 6) — 2 maxC(m, 0), (22) 
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in C spanned by the ellipsoid. Thus an ellipsoid with Z^ass — 1 has the same thickness as 
the canonical prolate ellipsoid, but can have a much larger extent in the ( direction. The 
subsequent stretching increases both the thickness, A([m,d), and the extent, A(^(m), by a 
factor of Zinass- Nonetheless, for lack of a better term, we will continue to refer to .^mass as 
the line of sight elongation. 

Finally, note that the isopotentials of an ellipsoidal mass distribution are rounder than 
the isodensity contours (and are in general not ellipsoids). Thus, even in the s = case, 
the typical elongation, Z of the isopotentials, which determines the ratio of true distance 
to that inferred from the spherical method, will be smaller than the axis ratio of the mass 
distribution. Conversely, for clusters in hydrostatic equilibrium, the axis ratio of the mass 
distribution will be greater than the typical values estimated from the shapes of X-ray 
isophotes. 

The calculation of the potential of an ellipsoidal mass distribution has been extensively 
discussed (see, e.g., Chandrasekhar 1987; Binney & Tremaine 1987). Since we are working 
in coordinates in radians, 

V^0(x) = Dfvmr) = AnGp{^)Df (23) 

so the potential, 0(x), is defined in terms of the combination p{'x)Df which is determined by 
the observations, independent of Hq. It is convenient to choose the zero point of the potential 
at the center of the cluster. Note that the hydrostatic equilibrium condition, equation (4), 
has the same form in terms of the dimensionless x, as long as the pressure is also differentiated 
with respect to x. 

3.3. Gas Model and the Chi-Squared Function 

Once we have the cluster potential, 0(x), we can add the intracluster gas. In hydro- 
static equilibrium, the gas can be described by its temperature profile, T{(f)) and the central 
electron density, ngo- To minimize numerically, we need a discrete parameterization for 
the temperature profile. We let log 7(0) be a polynomial of fixed order in 0. 

We then integrate equation (10) to obtain the density profile, ne(0). From the density 
and temperature profiles, we can calculate the electron pressure and X-ray emissivity in each 
energy band. Using 0(x), and including a factor of D for the path length, we can project 
to predict the SZ effect temperature decrement and the X-ray surface brightness. Given a 
model of the instruments used to observe the cluster, we can convert these predictions to 
the measured quantities. 
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The predictions depend on the various parameters of the model. They depend nonhn- 
early on the elongation and shear, ^mass and s, which fix the shape and normalization of 
the deprojected mass distribution, as well as on the central electron density, ngo, and the 
temperature profile, r(0). Finally, they are proportional to the angular diameter distance, 
D, to the cluster. 

We construct a function for the X-ray and SZ effect measurements in the usual way, 
weighting the square of the differences between the observations and the model predictions by 
the reciprocal of the variance of each measurement. We apply the chain rule to calculate the 
partial derivatives of with respect to the gas parameters and the distance. The dependence 
on the shape of the cluster is more complicated. We therefore break the x^ minimization 
problem into an inner part and an outer part. For a fixed set of shape parameters, we 
minimize with respect to the gas parameters and distance to find the best fit for that shape. 
This minimization defines an integrated x^, 

X^(^mass, S) = min X^ (^mass, S, 11^0, T(0), D) . (24) 
neO,T(0),D 

Then, we minimize this integrated x^ with respect to the elongation and shear, Zmass and 
s, which determine the shape of the cluster. For the inner minimization with respect to 
the gas parameters and distance, we use the Levenberg-Marquardt nonlinear chi-squared 
minimization algorithm, which takes advantage of the derivative information. 

Note that the lensing observations are not included in the x^ function. We already 
had to use them to fix the projected mass density, and thus determine 0, up to the three 
remaining shape parameters. Without this, we could not have made predictions for the 
projected gas observables. This special role for the lensing data is a direct consequence of 
our choice of symmetry assumption for the mass distribution; the shape of the isopotentials 
contours depends on the mass profile, and not just on the shape of the mass distribution. 

As a result, the effect of uncertainties in the lensing observations must be considered 
separately. They will lead to some range of acceptable parameters in the fit to the projected 
mass information in §3.2. By repeating the inversion procedure for different values of these 
parameters, we can test the effect on the angular diameter distance. 

4. Test Cases 

To test our de-projection method, we apply it to artificial observations of model clusters, 
to see how well we recover the original shape and distance. We will refer to these clusters 
as the true clusters. Our true clusters, like the models used to fit them, have ellipsoidal 
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mass distributions, so that we know a priori what the best fit shape ought to be in the 
absence of noise. The purpose of this test is to determine how sensitive the method is to 
the uncertainties in the observations. In particular, we want to ensure that there were no 
additional degeneracies which our analytic treatment in §2 might have missed. 

For the mass density profile, p, as a function of our angular ellipsoidal coordinate, m, 
we use a /3-model, 



p{m) = p(0) 



1 + 





21 -3/3/2 



2 

mf 



(25) 



with core rriQ. We choose f3 = 2/3. While such a model with a core is generally more 
appropriate to the X-ray gas than to the cluster matter distribution, it should be sufficient 
for testing purposes. This choice also reduces the number of integrals which must be done 
numerically to calculate the potential. 

For the X-ray gas, we assume a polytropic temperature profile with a central temperature 
T = 8.0 keV and a polytropic index, n, equal to 1.1. We assume a fully ionized hydrogen- 
helium plasma with a hydrogen fraction of 0.76 by mass. We take the central electron density, 
rieo, equal to 1.0 x 10~^cm~^. The density normahzation only affects the signal-to-noise of 
the gas observations. 

We use two sets of true clusters. First, we consider ellipsoidal clusters axisymmctric 
about the line of sight, which we fit with models with the same symmetry. This allows us to 
treat the observables as functions of one variable, the angular radius, 9. It also reduces the 
shape parameter space from three dimensions to one, the mass elongation, Z^^ss relative to 
spherical. Both changes greatly reduce the CPU time required for the inversion. We consider 
clusters with different elongations, .^mass, in the mass distribution, relative to spherical, 
ranging from 0.6 to 1.4. We normalize each density profile to produce the same projected 
mass, regardless of the elongation. Thus, we are testing whether the X-ray and SZ effect 
measurements allow us to distinguish between clusters with identical lensing observations. 
Recall that the elongation, Z, of the potential, which directly affects the inferred distance, 
is generally smaller than Zmass- 

Second, we consider the more general case of clusters of arbitrary shape and orientation. 
For this case, we use three true clusters: two triaxial clusters and one axisymmetric but with 
its axis inclined to the line of sight. Their shapes are listed in Table 1. We observe the two 
triaxial clusters along two different lines of sight, and the axisymmetric cluster along one. 
Table 2 lists the trial observations. The fines of sight are specified by spherical coordinates: 
the inclination, i, with respect to the third (c) axis, and the azimuthal angle, 0, with respect 
to the first (a) axis. 
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Table 1: Triaxial Test Cases 
Shape name Axes 

{h-^ Mpc) 
a b c 
1.2 0.8 2.0 
B 2.0 1.0 1.0 

C 0.7 1.4 2.0 

Table 2: Trial Observations of Triaxial Test Cases 
Observation Shape*^ Line of sight direction 

inclination azimuthal angle 0*^ 







(deg) 


(deg) 


Al 


A 


30 


40 


A2 


A 


60 


75 


B 


B 


60 





CI 


C 


30 


40 


C2 


C 


60 


75 



"see Table 1 

''with respect to the c axis of the ellipsoid 
'^with respect to the a axis of the ellipsoid 

When choosing the model length and density scales, it is sometimes easier to think 
in terms of the physical quantities. We choose the physical core length scale rrioD to 
be 0.1 Mpc. For the clusters axisymmetric about the hue of sight, we choose a cen- 
tral density, p(0), equal to 4 x 10^^Z~^g /i^M©/ Mpc^. For the other clusters, p(0) = 
4 X 10-*^^ H^Mq/ Mpc^, regardless of shape and orientation. 

Of course, to convert these physical quantities to the i/o-independent quantities actually 
used in the model, we need to specify the scaled distance, hD. We will take the redshift, 
of our clusters to be 0.2. This is far enough away to make weak lensing measurements 
feasible, yet close enough for X-ray instruments such as Chandra to produce high signal-to- 
noise maps in many energy bands. We assume a flat ACDM cosmology with — 0.3 and 
Ao = 0.7, which gives D — 476 Mpc. Then, we calculate mo and p{0)D'^. 

Note that for a given choice of the physical quantities, the angular scale of the cluster, 
will vary slightly with cosmology, as will the normalizations of the X-ray and SZ effect 
observations. Thus, the signal-to-noise of these observations does depend slightly on our 
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choice of cosmology. However, the angular scale and signal-to-noise would also vary from 
cluster to cluster in a given cosmology. In a different cosmology, the same calculation with 
the same angular scale and signal-to-noise would simply correspond to a slightly different 
physical cluster, but would not be fundamentally altered. 

When we project the local electron pressure and X-ray emissivity to create the artificial 
observations of the intracluster gas, we also need the actual distance, D. We take h = 0.7, 
so D = 681 Mpc. Again, given the choice of the physical mass profile parameters, this will 
affect the signal-to-noise. Of course, neither this assumed value of h nor that of D is used 
by the de-projection procedure. 

5. Artificial Observations 

Next, we need to create artificial observations of these cluster models, both the true 
clusters and the trial models used in the fitting. Our goal is to see how the observational 
noise and uncertainties affect our ability to recover the true shape and distance. We do 
not attempt to model in detail all of the properties of specific instruments. Instead, we 
use simplified models with roughly the same sensitivity. The possibility of creating high 
signal-to-noise weak lensing and SZ effect maps is relatively new. Therefore, we allow for 
some improvement over the current state-of-the-art in these observations. The field of X-ray 
astronomy is comparatively mature, so we base our model more closely on current instrumen- 
tation. We add noise to the true cluster observations but not to the trial models. Otherwise, 
we use the same models to create both sets of artificial observations. 



5.1. Weak Lensing 

As we mentioned in §3.3, incorporating uncertainties in the weak gravitational lensing 
measurements requires additional Monte Carlo testing of the entire de-projection analysis. 
Because of this, as well as rapid advances in the weak lensing observations, we have entirely 
ignored the weak lensing uncertainties in our initial tests. Instead, we use the exact de- 
projection of the density profile of the true cluster. In doing so, we are also not restricting 
ourselves to using only the weak shear information. Thus, while we showed in §2.4 that 
the elongation could be recovered from the weak shear alone, our numerical tests do not 
demonstrate this. 
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5.2. 



SZ Effect 



We base our artificial SZ effect observations roughly on the maps made by fitting the 
Owens Valley Radio Observatory (OVRO) millimeter-wave interferometer with 30 GHz (1 cm 
wavelength) receivers (Carlstrom, Joy, & Grego 1996). At this frequency, the proportionality 
constant between the temperature decrement and the Compton y-parameter in equation (2) 
differs shghtly from —2. Relativistic corrections (Rephaeli 1995; Challinor & Lasenby 1998) 
also modify the proportionality constant. However, the total correction is roughly 5% (Patel 
et al. 2000), so we ignore it. 

A two-element interferometer measures the complex visibility. 



a Fourier component of the intensity, /, where u = Ar/A is the baseline separation of the 
antenna elements in units of the wavelength, A. Equation (26) is valid as long as the sepa- 
ration, Ar, between the antennas is much larger than their diameter, which is usually the 
case in radio interferometry. The Fourier transform of this sparse Fourier-space sampling is 
called the "dirty" beam. Taking the Fourier transform of the visibilities results in a "dirty" 
image which is the convolution of the true intensity with the dirty beam. Convolution tech- 
niques are used to recover a smooth, "clean" image with the equivalent resolution. Clusters 
of galaxies, however, are large compared to the angular resolution of interferometric arrays. 
The ability to measure the temperature decrement on large scales is limited by the shortest 
baselines. Therefore, these observations are generally made with the array in a nearly close- 
packed configuration to maximize sensitivity to the SZ temperature decrement on cluster 
scales, with some longer baselines for detection and removal of point sources. For OVRO, 
this system produces a roughly 1' beam and maps with an rms noise of ~ 27 /jK per beam 
in the temperature decrement with 60 hr of observations (Patel et al. 2000). We adopt this 
resolution and sensitivity. 

Such close-packed configurations produce "dirty" beams which are actually smooth, 
so the data can be analyzed in real space (Pen et al. 2002). The temperature decrement 
measurements have Gaussian white noise, so the noise remains uniform and Gaussian when 
transformed to real space. The physical separation of the dishes must be greater than the 
diameter of the individual dishes. This hmits the shortest baselines measured, which results 
in a hole in the Fourier space coverage. In real space, the effect of this hole is approximately 
equivalent to subtracting the diffraction limited image which would be produced by a single 
dish from an image with the diffraction limit of the entire array. This gives an effective 
beam which has a total area of zero, as appropriate for interferometric observations with 




(26) 
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no measurements at zero-spacing. For the OVRO array, each dish has a 10.4 m diameter, 
corresponding to a diffraction hmit of 4' at 30 GHz. 

For simphcity, in the axisymmetric case, we hmit the field of view of our temperature 
decrement observations to 3' in radius, covered with three 1' annuh, and remove the short 
basehnes by subtracting the average temperature decrement over this field from the individ- 
ual anmili. This simple approach produces the correct spatial resolution and noise per pixel, 
but it reduces the field of view and the total number of pixels. It also ties the diameter of the 
field of view to the dish size, preventing us from changing one without changing the other. 

For clusters with arbitrary orientation, the field of view is no longer circular, so this 
approach no longer makes sense. Therefore, we use an 8' x 8' field of view, with a rectangular 
grid of 1' pixels, to represent the image with the diffraction limit of the entire array. We 
then convolve this image with a Gaussian of unit area and full width at half maximum equal 
to the single dish diffraction limit and subtract the convolved image from the original. 

5.3. X-ray Observations 

For our artificial X-ray observations, wc use a simplified model of the Advanced CCD 
Imaging Spectrometer (ACIS) on the Chandra X-ray Observatory as described by the Chan- 
dra Proposers' Observatory Guide (2001). The four front-illuminated CCD chips of the 
ACIS-I imaging configuration have 0.492" pixels over a 16.9' x 16.9' field of view. The half- 
energy radius varies roughly from 1 to 10 pixels from the center of the field of view to an 
off-axis angle of 8'. The full spatial resolution is unnecessary for our purposes, so in the 
axisymmetric case, we use 50 annuli, each 9.6" wide, out to a radius of 8'. In the case of 
the clusters of arbitrary orientation, we use a 20^ rectangular grid covering and 8' x 8' field 
of view (one-quarter of the Chandra ACIS-I field), giving 24" x 24" pixels. This resolution 
is quite crude, but is necessary to ensure that the chi-squared fitting can be performed in a 
reasonable amount of CPU time. 

For simplicity, we ignore X-ray line emission and consider only thermal Bremsstrahlung. 
The ACIS energy resolution varies with energy and position on the chips. However it is 
generally < 0.3 keV. Wc create maps in 30 energy bands of this width between IkeV and 
10 keV. To calculate the count rates, we multiply the X-ray surface brightness for each band 
by the angular area of each annulus and by the effective area of each band. We assume an 
effective area of ^4 = 500 cm^ for £" < 2 keV, 300 cm^ for 2 < < 5.5 keV, and a hnear falloff 
in log A to 14.3 cm^ from 5.5 keV to lOkeV. This approximates the on-axis effective area of 
the front-illuminated CCDs, while simplifying the detailed edge structure. For simplicity, we 
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also ignore the change in effective area with off- axis angle, which is roughly a 15% decrease 
at the edge of our field of view. 

We add a contribution to the count rates from the background (both the actual diffuse X- 
ray background and the charged particle background) of 0.07 counts/chip/ sec for < 2 keV 
and 0.15 counts/chip/ sec for energies from 2keV to lOkeV as estimated from the on-orbit 
measured background for the ACIS-I chips (Chandra Proposers' Observatory Guide 2001). 
For simplicity, we add the same background to the count rates for the trial models, rather 
than adding a background term whose normalization is a free parameter of the fit. 

We multiply the count rates by an assumed lOOksec observation time to find the ex- 
pected number of counts in each band and annulus, and add Poisson fluctuations to these 
counts. Note that when we compute the Poisson uncertainties in the number of counts 
for the function, we use the mean values. With actual observations, only the measured 
number of counts including noise would be known. This would introduce a bias to actual 
observations, which would have to be corrected. 

6. Numerical Details 

The projected gas observables, the X-ray surface brightness and the temperature decre- 
ment, are approximated by discrete sums as described in the Appendix. 

Our true clusters have ellipsoidal mass distributions, so the projected mass distributions 
are elliptical. Since we are not including any uncertainties in the lensing information, it is 
not necessary to fit an elliptical model to the projected mass and de-project it explicitly for 
each assumed shape. The de-projected mass profile will differ from the true profile, p(m^), 
only by the ratio of the true mass elongation, Zmass, to the assumed value. 

For clusters axisymmetric about the line of sight, our shape parameter space is reduced 
to a single dimension, the elongation, .Z'jnass- For simplicity, we perform the outer minimiza- 
tion of with respect to elongation by a simple grid search. We evaluate the integrated 
X^(-^mass) on a uniform grid of 33 values of assumed elongation in a range of ±0.2 about the 
true elongation, and pick the elongation which gives the lowest chi-squared. 

In the case of clusters of arbitrary shape and orientation, such a brute force grid search is 
no longer feasible. Instead, we use a general-purpose multidimensional minimization routine 
based on Powell's method. To ensure that the minimization routine has converged, we restart 
it twice at the previous minimum. In practice, there was no significant change in the solution 
after the first restart. 
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To find the initial guess for this minimization routine, we perform a low resolution grid 
search over a 4 by 2 by 2 grid of cells. We initially evaluate the integrated at the center 
of each cell and find the cell with the smallest value. We then refine the grid by evaluating 
in a 3 by 3 by 3 grid within this cell. The shape with the smallest integrated is used 
as the initial guess for the minimization routine. 

As mentioned earlier, we need a discrete parameterization for the temperature profile. 
We take logT(0) be a polynomial of fixed order in 0. in the axisymmetric case, we used a 
fourth order polynomial. In the general case, we found that a third order polynomial was 
more effective in avoiding unphysical oscillations in the temperature profile in the initial 
guess for the cluster shape was poor. 

The inner minimization, with respect to the gas parameters and the distance, also 
requires an initial guess. We guess a constant temperature profile of 7.7 keV, roughly equal 
to the true central temperature of 8keV. In the axisymmetric case, we take the central 
electron density to be twice the true value, and the distance to be 0.833Z~^gg (assumed) of 
the true value. In the general case, we take the central density to be 1.5 times the true 
value, and the distance to be 1.1 times the true value. As long as the initial guess for the 
temperature is not too extreme, the code rapidly converges to essentially the same best fit set 
of parameters, regardless of the choice of initial guess. Using a fairly accurate initial guess for 
the central temperature is realistic since the central temperature of a relaxed cluster should 
be reasonably well determined from the overall emission-weighted spectrum. 

For each test case, we create 20 realizations of the artificial observations with noise, 
which are analyzed independently. This allows us to estimate the uncertainties in the param- 
eters inferred from a single observation due to statistical uncertainties in the measurements. 

7. Results 

7.1. Axisymmetric Clusters 

First, we show the results of fitting the true clusters which are axisymmetric about the 
line of sight, with models of the same symmetry. Figure 1 shows the mean values of the 
inferred mass elongation for each true elongation. The error bars are the lex uncertainties for 
a single realization, calculated from the sample variance of the 20 realizations. The diagonal 
line shows Zmagg (inferred) = Z^^si^rwe). The mean elongations are consistent with no bias 
relative to the true elongations (with a Scr upper hmit of 1 to 2%). The uncertainties for 
a single realization range from 1 to 3%. Since we plot mean values (to test for systematic 
bias), the deviations from the true elongation are of course much smaller than the error bars. 
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which represent the uncertainty in a single realization. 

The inferred elongations shown are from simultaneous fits for the elongation, distance, 
central density, and temperature profile. However, we obtain nearly identical results if 
we use only the X-ray and lensing data, and fit for elongation, temperature profile, and 
the combination nl^D. This is to be expected from our analytic analysis in §2.4, where 
we saw that constraining elongation required the absolute temperature and the relative 
temperature and density profiles. The SZ effect by itself only constrains the product of 
the two profiles. Nor can it constrain the absolute temperature, if Hq and thus the cluster 
distance arc unknown. Therefore, the elongation is likely to remain determined primarily by 
the X-ray and lensing observations, even with improvements in the temperature decrement 
measurements. 

In Figure 2, we show the inferred distance for clusters of each true elongation. Again, the 
values and error bars represent the mean over the 20 realizations, and the la uncertainties 
of a single reahzation. The dashed hue indicates the true distance of 681 Mpc. There is 
no significant bias (with a 3(T upper limit of 1 to 2%), and the uncertainties in a single 
reahzation are of order 2 to 3%. 

7.2. General Case 

Next, we fit triaxial models to the second set of true clusters described in §4, those 
with arbitrary orientation and shape. For each observation listed in Table 2, we created 20 
realizations of the noise, and analyzed each one independently. Figure 3 shows the inferred 
angular diameter distance for each observation. The values represent the mean of the 20 
realizations. The error bars are the la uncertainties due to instrumental noise for a single 
realization, calculated from the sample variance, which range from 3 to 6%. The dashed line 
indicates the true distance of 681 Mpc. The mean distances are consistent with no bias, with 
3cr upper limits ranging from 2 to 4%. 

The true elongation, Z^aass is recovered to 2 to 4%. The absolute errors in the shear 
ranged from 0.05 to 0.10. 
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Fig. 1. — Inferred vs. true mass elongation, ^mass) ^ estimated from 20 realizations of 
the artificial observations. The error bars are the sample standard deviation of the 20 
realizations, corresponding to the la uncertainties for a single realization. 
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Fig. 2. — Inferred distance for each true mass elongation, Zj^^ss as estimated from 20 real- 
izations of the artificial observations. The error bars are the sample standard deviation of 
the 20 realizations, corresponding to the la uncertainties for a single realization. 
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Fig. 3. — Inferred distance for each trial observation (see Table 2) as estimated from 5 
realizations of the artificial observations. The error bars are the sample standard deviation 
of the 5 realizations, corresponding to the la uncertainties for a single realization. 
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8. Discussion 

8.1. Statistical Uncertainties 

The variation among realizations measures the sensitivity of the inferred distance to 
statistical uncertainties in the X-ray and temperature decrement observations. By examining 
correlations between the fractional errors in the distance and in the mass elongation, it 

is possible to show that the uncertainty in the X-ray temperature makes the dominant 
contribution to the uncertainty in the distance in the case of clusters axisymmetric about 
line of sight (Fox 2001). In the general case, however there was no obvious correlations 
between the distance and the shape parameters, Z^nass and s. 

As we noted in §5.1, we have not attempted to model the uncertainties in the gravita- 
tional lensing observations. Additional Monte Carlo tests with realistic treatment of these 
uncertainties, and their effects on the fitting and de-projection of the surface mass distribu- 
tion, should be performed. 

8.2. Systematic Errors 

Besides the statistical measurement uncertainties, there are number of systematic effects 
which could affect our determination of the cluster distances and of Hq. Many of these effects 
are also relevant to the traditional method based on a spherical model. For recent reviews 
of the systematic uncertainties, see Carlstrom et al. (2001) and Molnar, Birkinshaw, & 
Mushotzky (2002). 

We will focus primarily on those effects which are particular to our method. We first 
discuss the dependence of the inferred Hq on the magnitude of the observed quantities, and 
the resulting bias due to either contamination of these observations or to errors in their 
absolute calibration. Then, we consider aspects of the state of the intracluster gas which 
could violate the underlying physical assumptions of our model. Finally, we mention some 
caveats regarding our assumption of ellipsoidal symmetry. 

8.2.1. Normalization Dependence 

The distance determination depends on the normalizations of the measured temperature 
decrement. X-ray surface brightness, and X-ray temperature. Therefore, calibration errors 
or other systematic errors in these observations will directly affect the value of Hq inferred. 
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In the traditional method, 

^0 oc A-„Led TtS^' (27) 

assuming that the value for Hq has been averaged over sufficiently many clusters of random 
orientation to remove the effect of the elongation. For a single cluster, there would be an 
additional factor of the inverse of the gas elongation, Z. In our method, Hq can, in principle, 
be inferred from a single cluster by correcting for this factor. However, this correction 
introduces an additional dependence on the ratio of the potential, 0, determined from weak 
lensing observations, to the X-ray temperature. Prom equations (13) and (14), we can show 
that the inferred gas elongation, 

Z«5^. (28) 

e 

Recall that the product A is independent of the distance to the lensing cluster, Di, 
and depends only on the cosmology and the redshifts of the cluster and the lensed sources. 
Combining equations (27) and (28), we see that for our method, 

[/XIcmb) 

Thus, our method and the traditional method have the same sensitivity to errors in the 
normalization of the X-ray surface brightness or the temperature decrement. Possible sources 
of contamination of these measurements, as well as uncertainties in their calibration, are 
discussed by Carlstrom et al. (2001). 

Due to the fortuitous cancellation of one power of the electron temperature between 
equations (27) and (28), we expect our method to be less sensitive to the electron temperature 
than the traditional method. Nonetheless, the inferred value of Hq will still be directly 
proportional to the absolute calibration of the X-ray temperature, as well as that of the 
surface brightness. 

Our method introduces a new dependence on the calibration of the weak lensing obser- 
vations. Our inferred Hq is now proportional to the normalization of the lensing potential, 
ip, (or its second derivatives, the convergence, k, and the shear, 7) as well as to 

For measurements of weak shear from the shapes of background galaxies, the normalization 
of the inferred shear depends on accurately correcting for the seeing conditions. Calculating 
Ecr-D/ requires knowledge of the redshift distribution of the background galaxies, and of the 
choice of cosmology {flo and Aq) which affect the shape of the redshift-distance relation. Note 
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that all lensing measurements depend on Ed- Therefore, it may be possible to eliminate the 
cosmology dependence if the redshift distribution is itself inferred from lensing measurements. 
Several techniques have been suggested for doing so (Small et al. 1994; Kneib et al. 1994, 
1996; Bartelmann & Narayan 1995). 

Our analytic models in §2 demonstrated that the elongation-distance degeneracy could 
be broken without resolving the mass sheet degeneracy in the lensing observations. However, 
while we have proven this in principle, we have not yet demonstrated it in practice. In 
fact, our assumption in §3.2 that the product DiL was known presumes that the mass 
sheet degeneracy has been resolved. This could be done by measuring the convergence. 
Alternatively, we can assume that the surface density falls to negligible values far from 
the center of the cluster. Given shear observations over a wide enough field, this may be 
sufficient. Finally, the method might be reformulated to remove any dependence on the mass 
sheet degeneracy. 



8.2.2. The Physical State of the Intracluster Gas 

In comparing n1 inferred from the X-ray observations with Ue inferred from the temper- 
ature decrement, both our method and the traditional one assume that the X-ray emission 
and the scattering of the microwave background photons is due to the same population of 
electrons. If the intracluster gas is not smooth but rather clumpy, with C = {v?^ / {ueY > 1, 
then the inferred distance to the cluster will be systematically underestimated by a factor 
of C (Birkinshaw et al. 1991). In addition, a multiphase medium introduces additional de- 
grees of freedom which may make it impossible to constrain the elongation. The gas in the 
cooling flows observed in the centers of some clusters is in fact believed to be multiphase. In 
applying the our method to real clusters, it would be advisable to remove any cooling flow 
region from the X-ray maps and fit to the remaining data. 

Our method of inferring the elongation depends on the assumption that the intracluster 
gas is in hydrostatic equilibrium in the cluster potential well. A distinct advantage of the 
traditional method is that it does not rely on this assumption. The characteristic time 
for a cluster to reach hydrostatic equilibrium is the sound wave crossing time. The high 
temperature of the intracluster gas makes this time scale much shorter than the Hubble 
time (Sarazin 1988). However, clusters which have undergone recent mergers may not have 
had time to fully relax. Mergers tend to increase the X-ray luminosity of clusters, so cluster 
samples selected by luminosity will tend to include an enhanced fraction of mergers (Roettiger 
et al. 1997). Deviations from hydrostatic equilibrium could introduce either random or 
systematic errors in our elongation correction. 
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We have also implicitly assumed that thermal pressure is the dominant source of support 
for the gas. Other possible sources include rotation, magnetic or turbulent pressure, or 
pressure from a population of relativistic electrons and ions. If these were significant, our 
method would give incorrect results for the elongation and the distance. 

It is essential to quantify these possible errors by applying the method to a set of 
simulated clusters. 



8.2.3. Symmetry and Modeling Assumptions 

Real clusters will differ from the models we use to fit them, and this may also introduce 
both random and systematic errors. Typically, measurements of Hq from X-ray and SZ effect 
observations have used a beta model to fit the gas density distribution. Often, in the absence 
of spatially resolved X-ray temperature mcasiu^cmcnts, an isothermal temperature has also 
been assumed. A number of authors have estimated the systematic effects in the traditional 
method due to deviations from these model assumptions, as well as that of spherical sym- 
metry (Sulkanen 1999; Inagaki et al. 1995; Roettiger et al. 1997; Puy et al. 2000). We have 
tried to take advantage of the improved spectral resolution of the latest X-ray telescopes, 
and of anticipated improvements in observations of the SZ effect, to relax these assumptions 
about the density and temperature profiles of the gas, allowing them to the constrained only 
by the observations and by the assumption of hydrostatic equilibrium. 

Even so, de-projecting clusters still requires a fairly strong assumption about symme- 
try. We have assumed that the cluster mass distribution is constant on families of similar, 
concentric, coaxial ellipsoids. Our hope is that such a model will be sufficient for inferring to 
first order the deviation from spherical symmetry due to elongation. However, X-ray clusters 
often show evidence for significant irregularity and substructure (e.g., Jones & Forman 1992; 
Mohr et al. 1995). If the deviations from our model arc large enough, it may fail to fit the 
observations, or, worse, give misleading answers. For example, our model would be unlikely 
to give correct results when applied to a cluster which was actually two clusters along the 
same line of sight (unless one of the two clusters dominated all three of our observational 
probes, or unless the two shared a common envelope). Selection effects in the detection 
of gravitational lensing could bias a sample of clusters toward such cases. Even fairly reg- 
ular clusters will not have the perfect ellipsoidal symmetry we have assumed. They may 
be elongated but not ellipsoidal. They may be roughly ellipsoidal, but with smaller scale 
irregularities, or with axis ratios and orientations which vary with radius. Again, the best 
way to quantify the resulting uncertainties, both random and systematic, is to apply the 
method to a sample of simulated clusters, with realistic modeling of the selection effects. 
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Mass in filaments and other correlated structure near clusters contributes significantly 
to the total projected mass measured by lensing, and to its scatter, though the magnitudes 
of both the bias and the scatter depend on the particular mass estimator used (Metzler et 
al. 1999; Metzler, White, & Loken 2001). By applying our method to numerically simulated 
clusters, we will be able to tell how much of an effect this has on the inferred distances. 

9. Conclusions 

X-ray and SZ effect observations provide a geometric method of measuring the distance 
to clusters of galaxies, independent of the usual distance ladder calibrations. This method 
can be used to measure the rcdshift-distancc relation and inferred the value of Hq. However, 
the method is subject to an exact degeneracy between the distance and the elongation of 
the cluster along the line of sight, relative to its size perpendicular to the line of sight. This 
degeneracy introduces significant scatter in the measured values of i^o, requiring a large 
sample of clusters of random orientation for an accurate determination. To infer and Aq 
by the redshift-distance relation would require an even larger sample. Furthermore, if the 
sample is biased towards particular orientations due to selection effects, this would introduce 
a systematic error in Hq. 

The elongation-distance degeneracy can be broken if the intracluster gas is in hydro- 
static equilibrium, and if the projected mass density can be inferred from gravitational 
lensing measurements of background galaxies. We have developed a specific method to do 
so, using models with ellipsoidal mass distributions, and applied it to simple axisymmetric 
ellipsoidal model clusters. We recover the true shape and distance to each cluster with sta- 
tistical uncertainties of 4 to 6%, and no detectable systematic bias (with a 3(7 upper limit of 
4%), using artificial X-ray and SZ effect observations with sensitivity comparable to current 
observations. 

Further work is necessary to determine the sensitivity of the method to realistic uncer- 
tainties in the lensing observations. Particular attention should be paid to possible systematic 
uncertainties due to the mass sheet degeneracy, and to the dependence of the critical surface 
density on the redshift distribution of the lensed background galaxies. The systematic un- 
certainties in the calibration of the X-ray surface brightness and temperature observations 
and the SZ effect observations are also important. 

Finally, real clusters are not the perfectly regular, ellipsoidal objects we have assumed in 
our models, nor are they necessarily in perfect hydrostatic equilibrium. It is essential to test 
the method by applying it to numerically simulated clusters, which more closely resemble 
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real clusters, at least in their greater degree of messiness. 
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A. APPENDIX 

Since the gas properties are constant on isopotentials, the projection of a local gas 
quantity, is 

E(0) = D J dC/ (0(0,0)- (Al) 

The projection operator is linear, and we already have a finite set of n lines of sight, 0i, so 
an obvious approach is to discrctizc the local quantity, and calculate a projection matrix, 
K. We pick a finite grid in 0, 0o < 0i < ■ ■ ■ < 0m-i = 0max- Then, the projection reduces 
to a matrix-vector multiplication, 

j 

where fj = f{(pj)- Similarly, the gradient with respect to the gas parameters, a, is simply a 
matrix-matrix multiplication, 

VaE(0,)= j;]L>X,,Va/,- (A3) 
j 

To calculate the elements of the projection matrix, K, we replace /(0) in equation (Al) 
with a linear interpolation in between the grid points. When < 0o, we let /(0) = /(0o)- 
For each hne of sight, we cut off the integral at = 0max- The elements of K, depend only 
on the potential, 0(x), the potential grid, and the set of hues of sight, not on the local gas 
properties, / (or Va/)- Thus, we can use the same matrix for all X-ray bands, and for all 
trials with a single assumed cluster shape. 

The correct choice of the cutoff, 0max, is rather dehcate. Because it is proportional to 
the electron density squared, the X-ray emission along a given line of sight is dominated by 
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the contribution from isopotentials near the minimum of 0(0, C) with respect to C- If 0max 
is too large, none of the hnes of sight will have a minimum potential comparable to it. In 
that case, the gas properties at that isopotential will be poorly constrained, and minimizing 

may lead to an unphysical temperature profile. On the other hand, if 0max is too small, 
the cluster emission will be artificially cut off at some radius within the field of view. This 
radius may not be the same for both the model cluster and the true cluster, which will lead 
to systematic problems in the fitting. In addition, fixing 0max to a single value for all models 
tends to bias the shape fitting. More elongated de-projections have shallower potential wells, 
so a fixed grid in effectively reduces the angular resolution of these models, making their 

values artificially large. 

To avoid all these difficulties, we want to choose ^max for each model shape so as to 
ensure that the outermost lines of sight in the field view are tangent to (p — ^max- In the 
axisymmetric case, this simply means setting 0max equal to the minimum of the potential 
along a line of sight with the maximum radius. In the general case, we evaluate the minimum 
of the potential along the lines of sight at each corner and in the center of each edge of the 
field of view, and choose (^max equal to the largest such minimum. We use a uniform grid in 
the potential with 20 elements between and 0max- 
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